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Abstract 

The general problem of robust optimization is this: one of several possible scenarios will appear tomorrow 
and require to be covered, but things are more expensive tomorrow than they are today. What should you 
anticipatorily buy today, so that the worst-case cost (summed over both days) is minimized? We consider the 
k-robust model where the possible outcomes tomorrow are given by all demand-subsets of size k. 

In this paper, we give the following simple and intuitive template for fc-robust problems: having built some 
anticipatory solution, if there exists a single demand whose augmentation cost is larger than some threshold 
(which is w Opt/A: ), augment the anticipatory solution to cover this demand as well, and repeat. We show 
that this template gives improved approximation algorithms for fc-robust Steiner tree and set cover, and present 
the first approximation algorithms for fc-robust Steiner forest, minimum-cut and multicut. Our main technical 
contribution lies in proving certain net-type properties for these covering problems, which are based on dual- 
rounding and primal-dual ideas; these properties might be of some independent interest. All the approximation 
ratios (except for multicut) are nearly optimal. 

As a by-product of our techniques, we get algorithms for max-min problems of the form: "given a covering 
problem instance, which k of the elements are costliest to cover?" If the covering problem does not naturally 
define a submodular function, very little is known about these problems. For the problems mentioned above, we 
show that their fc-max-min versions have performance guarantees similar to those for the fc-robust problems. 



1 Introduction 

Consider the following k-robust set cover problem: we are given a set system (U, T C 2 U ). Tomorrow some set 
of k elements S C U will want to be covered; however, today we don't know what this set will be. One strategy 
is to wait until tomorrow and buy an 0(log re) -approximate set cover for this set. However, sets are cheaper today: 
they will cost A times as much tomorrow as they cost today. Hence, it may make sense to buy some anticipatory 
partial solution today (i.e. in the first-stage), and then complete it tomorrow (i.e. second-stage) once we know the 
actual members of the set S. Since we do not know anything about the set S (or maybe we are risk-averse), we 
want to plan for the worst-case, and minimize: 

(cost of anticipatory solution) + A • max (additional cost to cover S). 

Early approximation results for robust problems [ p| , |20| ] had assumed that the collection of possible sets S was 
explicitly given (and the performance guarantee depended logarithmically on the size of this collection). Since this 
seemed quite restrictive, Feige et al. JT4| ] proposed the fc-robust model where any of the (?) subsets S of size k 
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could arrive. Though this collection of possible sets was potentially exponential sized (for large values of k), the 
hope was to get results that did not depend polynomially on k. 



For the fc-robust set cover problem, Feige et al. [14] gave an 0(log m log ?7,)-approximation algorithm using the 



online algorithm for set cover within an LP-rounding-based algorithm (a la Q33Q). They also showed A>robust 
set cover to be O( lo 1 g ° 1 ^ m + logn) hard — which left a logarithmic gap between the upper and lower bounds. 
However, an online algorithm based approach is unlikely to close this gap, since the online algorithm for set cover 
is necessarily a log-factor worse that its offline counterparts [Q]. 

Closely related to the A>robust model are k-max-min problems, where given a covering problem instance the goal is 



to determine the fc-set of demands that are costliest to cover. Indeed, [14] used A>max-min set cover as a subroutine 
in their algorithm for A>robust set cover. If the covering problem defines a submodular objective then the A>max- 
min problem can be solved via constrained submodular optimization. However, most natural covering problems do 



not yield submodular functions, and so previous results cannot be applied directly. For A>max-min set cover Q14| ] 
used an online algorithm to obtain an 0(log m log n) -approximation algorithm. 

Apart from improving these results in context of set cover, one may want to develop algorithms for other A; -robust 
and A>max-min problems. E.g., for the k-robust min-cut problem, some set S of k sources will want to be separated 
from the sink vertex tomorrow, and we want to find the best way to cut edges to minimize the total cost incurred 
(over the two days) for the worst-case A>set S. Similarly, in the k-max-min Steiner forest, we are given a metric 
space with a collection of source-sink pairs, and seek k source-sink pairs that incur the maximum (Steiner forest) 



connection cost. Although the online-based-framework [ 14 ] can be extended to give algorithms for other /c-max 



min problems, it does not yield approximation guarantees better than the (deterministic) online competitive ratios. 



Moreover, for A; -robust problems other than set cover, the LP-rounding framework in [ |14| ] does not extend directly; 



this obstacle was also observed by Khandekar et al. [27] who studied A; -robust Steiner tree and facility location. 
1.1 Main Results 

In this paper, we present a general template to design algorithms for A; -robust and A;-max-min problems. We go 
beyond the online-based approach and obtain tighter approximation ratios that nearly match the offline guarantees; 
see the table below. We improve on previous results, by obtaining an 0(logm + logn) factor for A>robust set 
cover, and improving the constant in the approximation factor for Steiner tree. We also give the first algorithms for 
some other standard covering problems, getting constant-factor approximations for both A>robust Steiner forest — 
which was left open by Khandekar et al. — and for fc-robust min-cut, and an 0( ^[ ™ n ) approximation for A>robust 
multicut. Our algorithms do not use a max-min subroutine directly: however, our approach ends up giving us 
approximation algorithms for A>max-min versions of set cover, Steiner forest, min-cut and multicut; all but the one 
for multicut are best possible under standard assumptions. 

An important contribution of our work is the simplicity of the algorithms, and the ideas in their analysis. The 
following is our actual algorithm for A>robust set cover. 

Suppose we "guess" that the maximum second-stage cost in the optimal solution is T. Let A C U be 
all elements for which the cheapest set covering them costs more than (3 - T/k, where (3 = 0(log m + 
log n). We build a set cover on A as our first stage. (Say this cover costs Ct-) 

To remove the guessing, try all values of T and choose the solution that incurs the least total cost Ct + A/3T. 
Clearly, by design, no matter which k elements arrive tomorrow, it will not cost us more than A • k • f3T/k = X(3T 
to cover them, which is within (3 of what the optimal solution pays. This guess-and- verify framework is formalized 



in Sections 2.1 and 2.2 



The key step of our analysis is to argue why Ct is close to optimum. We briefly describe the intuition; details 



appear in [Section 3 . Suppose Ct 3> /30pt: then the fractional solution to the LP for set cover for A would 



cost S> y^Opt > Opt, and so would its dual. Our key technical contribution is to show how to "round" this dual 
LP to find a "witness" A' C A with only k elements, and also a corresponding feasible dual of value 3> Opt — i.e., 
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the dual value does not decrease much in the rounding. This step uses the fact that each element in A is expensive 
to cover individually. Using duality again, this proves that the optimal LP value, and hence the optimal set cover 
for these k elements A', would cost much more than Opt — a contradiction! 

In fact, our algorithms for the other fc-robust problems are almost identical to this one; indeed, the only slightly 
involved algorithm is that for A; -robust Steiner forest. Of course, the proofs to bound the cost Ct need different 
ideas in each case. These involve establishing certain net-type properties for the respective covering problems 
(which imply the existence of such a witness A' C A of size fc), and represent our main technical contribution. 
The proofs for set cover, min-cut and multicut are based on dual-rounding. In the case of Steiner forest, directly 
rounding the dual is difficult, and we give a primal-dual argument. 

For the cut-problems, one has to deal with additional issues because Opt consists of two stages that have to be 
charged to separately, and this requires a careful Gomory-Hu-tree-based charging. Even after this, we have to 
show the following net-type property: if the cut for a set of sources A is large (costs 3> Opt) and each source in 
A has a high individual cut (3> Opt/ k) then there is a witness A' C A of at most k sources for which the cut is 
also large (S> Opt). To this end, we prove new flow-aggregation lemmas for single-sink flows using Steiner-tree- 



packing results, and for multiflows using oblivious routing [30]; both proofs are possibly of independent interest 



To get a quick overview of our basic approach, see the analysis for Steiner tree in Appendix A . While the result 
is simple and does not require rounding the dual, it is a nice example of our framework in action. In Section 2 we 
present the formal framework for fc-robust and fc-max-min problems, and abstract out the properties that we'd like 
from our algorithms. Then Section 3[ contains such an algorithm for fc-robust set cover — Min-cut, Steiner forest 
and multicut appear in [Sections 4] , |6] and |5| The table below summarizes the best-known approximation ratios for 
various covering problems in the offline, /c-robust and online models (results denoted * are in the present paper). 



Problem 


Offline 


fc-robust 


Deterministic Online 


Set Cover 


Inn 

(l-o(l))lnn[jl| 


0(log m + log n) (*) 


0(logm ■ logn) [p] 

O / logm logn \ 


i 




I log log m+log log n J 


Steiner Tree 


1.39 [6] 


4.5 (*) 


6 (logn) [M] 


Steiner Forest 


2 [1 19] 


10 (*) 


e(logn) |5 1 


Minimum Cut 


1 


17 (*) 


OQog^n ■ log logn) |4 


24] 


Multicut 


O(logn) [0 


of, 10 ! 2 ™ ) (*) 

^ log log n J v > 


0(log 3 n ■ log logn) [| 


m 



1.2 Related Work 



Approximation algorithms for robust optimization was initiated by Dhamdhere et al. Ql 1[ ] : they study the case when 
the scenarios were explicitly listed, and gave constant-factor approximations for Steiner tree and facility location, 
and logarithmic approximations to mincut/multicut problems. Golovin et al. [ ^0| ] improved the mincut result to 
a constant factor approximation, and also gave an 0(l)-approximation for robust shortest-paths. The algorithms 



in Q20| ] were also "thresholded algorithms" and the algorithms in this paper can be seen as natural extensions of 
that idea to more complex uncertainty sets and larger class of problems (the uncertainty set in [|(]] only contained 
singleton demands). 



The fc-robust model was introduced in Feige et al. [14], where they gave an 0(logmlogn)-approximation for set 
cover; here m and n are the number of sets and elements in the set system. To get such an algorithm [|14|] first 
gave an 0(logm logn) -approximation algorithm for fc-max-min set-cover problem using the online algorithm for 
set cover [ph. They then used the fc-max-min problem as a separation oracle in an LP-rounding-based algorithm 
(a la jj33"l]) to get the same approximation guarantee for the fc-robust problem. They also showed an Q( } logTra 



* log log in > 



hardness of approximation for fc-max-min and fc-robust set cover. Khandekar et al. [ p7| ] noted that the LP-based 
techniques of [ [LH ] did not give good results for Steiner tree, and developed new combinatorial constant-factor 
approximations for fc-robust versions of Steiner tree, Steiner forest on trees and facility location. Using our frame- 
work, the algorithm we get for Steiner tree can be viewed as a rephrasing of their algorithm — our proof is arguably 
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more transparent and results in a better bound. Our approach can also be used to get a slightly better ratio than [27] 
for the Steiner forest problem on trees. 

Constrained submodular maximization problems [^, 15, |35|, [7|, [?7|] appear very relevant at first sight: e.g., the 
fc-max-min version of min-cut ("find the k sources whose separation from the sink costs the most") is precisely 
submodular maximization under a cardinality constraint, and hence is approximable to within (1 — 1/e). But apart 
from min-cut, the other problems do not give us submodular functions to maximize, and massaging the functions 



to make them submodular seems to lose logarithmic factors. E.g., one can use tree embeddings [12] to reduce 



Steiner tree to a problem on trees and make it submodular. In other cases, one can use online algorithms to get 



submodular-like properties and obtain approximation algorithms for the /c-max-min problems (as in [14]). Though 



the LP-based framework [ ]14| ] for k -robust problems does not seem to extend to problems other than set cover, in 
the companion paper [^3|] we give a general algorithm for /c-robust covering using offline and online algorithms. 
However, since our goal in this paper is to obtain approximation factors better than the online competitive ratios, it 
is unclear how to use these results. 

Considering the average instead of the worst-case performance gives rise to the well-studied model of stochastic 
optimization [|1|, Pq ]. Some common generalizations of the robust and stochastic models have been considered 
(see, e.g., Swamy [f36|] and Agrawal et al. 



To the best of our knowledge, none of the fc-max-min problems other than min-cut and set cover Q14| ] have been 
studied earlier. The k- min -min versions of covering problems (i.e. "which k demands are the cheapest to cover?") 
have been extensively studied for set cover [34, Tg], Steiner tree [17], Steiner forest [22], min-cut and multicut [21 



3C]. However these problems seem to be related to the fc-max-min versions only in spirit. 



2 Notation and Definitions 

Deterministic covering problems. A covering problem II has a ground-set E of elements with costs c : E — » 
R_i_, and n covering requirements (often called demands or clients), where the solutions to the z-th requirement is 
specified — possibly implicitly — by a family IZi C 2 E which is upwards closed (since this is a covering problem). 
Requirement i is satisfied by solution S C E iff S G TZ{. The covering problem IT = (E, c, {lZi}f =1 ) involves 
computing a solution S C E satisfying all n requirements and having minimum cost Ylees Ce ~ E-g., in set cover, 
"requirements" are items to be covered, and "elements" are sets to cover them with. In Steiner tree, requirements 
are terminals to connect to the root and elements are the edges; in multicut, requirements are terminal pairs to be 
separated, and elements are edges to be cut. 

Robust covering problems. This problem, denoted Robust(IT), is a two-stage optimization problem, where 
elements are possibly bought in the first stage (at the given cost) or the second stage (at cost A times higher). In the 
second stage, some subset u C [n] of requirements (also called a scenario) materializes, and the elements bought 
in both stages must satisfy each requirement in ui. Formally, the input to problem Robust(n) consists of (a) the 
covering problem II = (E,c, {7Zi}f =1 ) as above, (b) a set 17 C 2^ of scenarios (possibly implicitly given), and 
(c) an inflation parameter A > 1. A feasible solution to Robust(IT) is a set of first stage elements Eq C E (bought 
without knowledge of the scenario), along with an augmentation algorithm that given any wed outputs E w C E 
such that EqUE^ satisfies all requirements in co. The objective function is to minimize: c(Eq) + A-max we ^ c{E u ). 
Given such a solution, c(Eq) is called the first-stage cost and max^gj] c{E LJ ) is the second-stage cost. 

fc-robust problems. In this paper, we deal with robust covering problems under cardinality uncertainty sets: i.e., 
17 ■= (W) ={S Q [n] | \S\ = k}. We denote this problem by Robust fe (n). 

Max-min problems. Given a covering problem II and a set 17 of scenarios, the max-min problem involves finding 
a scenario uj G 17 for which the cost of the min-cost solution to u is maximized. Note that by setting A = 1 in 
any robust covering problem, the optimal value of the robust problem equals that of its corresponding max-min 
problem. In a £>max-min problem we have 17 = (^). 
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2.1 The Abstract Properties we want from our Algorithms 



Our algorithms for robust and max-min versions of covering problems are based on the following guarantee. 

Definition 2.1 An algorithm is (a±, a 2 , /3) -discriminating iff given as input any instance of Robust ( t(IT) and a 
threshold T, the algorithm outputs (i) a set &t ^ E, and (ii) an algorithm Augment^ : — > 2 E , such that: 

A. For every scenario D G (^). 

(i) the elements in &t U Augment r (Z?) satisfy all requirements in D, and 

(ii) the resulting augmentation cost c (Augment T (D)) < /3 ■ T. 

B. Let <I>* and T* (respectively) denote the first-stage and second-stage cost of an optimal solution to the 
Robustfc(n) instance. If the threshold T > T* then the first stage cost c(&t) < «i • <I>* + «2 • T*. 

The next lemma shows why having a discriminating algorithm is sufficient to solve the robust problem. The issue 
to address is that having guessed T for the optimal second stage cost, we have no direct way of verifying the 
correctness of that guess — hence we choose the best among all possible values of T. For T T* the guarantees 



in Definition 2.1 ensure that we pay « <&* + T* in the first stage, and « AT* in the second stage; for guesses 



T <^T*, the first-stage cost in guarantee (2) is likely to be large compared to Opt. 

Lemma 2.2 If there is an (ai, a 2 , ^-discriminating algorithm for a robust covering problem Robustfe(n), then 
for every e > there is a ((1 + e) • max {ai, j3 + ^-^-approximation algorithm for Robust/ (II). 

Proof: Let A denote an algorithm for Robustjt(Il) such that it is (ai, 02, /3) -discriminating. Let ground-set E = 
[m], and c max := max ee r m i c e . By scaling, we may assume WLOG that all costs in the Robustfc(II) instance 
are integral. Let e > be any value as given by the lemma (where - is polynomially bounded), and N := 

[log 1+e (m c max )~\ + 1; note that N is polynomial in the input size. Define T := | (1 + e) 1 \ < i < n\. 

The approximation algorithm for Robust/;. (II) runs the (cxi, 02, /^-discriminating algorithm A for every choice of 
T G T(here |T| is polynomially bounded), and returns the solution corresponding to: 

f := argmin{c($ T ) + A • f3T \ T G T}. 

Recall that T* denotes the optimal second-stage cost, clearly T* < m ■ c max . Let i* G Z + be chosen such that 
(1 + ef _1 <T*^< (1 + ef* ; also let V := (1 + ef (note that V G T). The objective value of the solution from 
A for threshold T can be bounded as follows. 

c(®f) + A • max c(Augmenty(w)) < c(®f) + A • /3 f 

< (ai • + a 2 ■ T*) + (1 + e) /?A ■ T* 

< (1 + e) • [ Ql ■ + (p + ^) ■ AT* 



The first inequality follows from Property A(ii) in Definition 2.1; the second by the choice of T; the third by 



Property B (applied with threshold T > T*) in pefinition 2T] , and using T' < (1 + e) • T*. Thus this algorithm 
for Robustfc(II) outputs a solution that is a ((1 + e) • max {a>i, f3 + ^ }) -approximation. ■ 

In the rest of the paper, we focus on providing discriminating algorithms for suitable values of a±, /3. 
2.2 Additional Property Needed for fc-max-min Approximations 

As we noted above, a /c-max-min problem is a fc-robust problem where the inflation A = 1 (which implies that in 
an optimal solution <£* = 0, and T* is the fc-max-min value). Hence a discriminating algorithm immediately gives 
an approximation to the value: for any D G (^), U Augment T (rJ) satisfies all demands in D, and for the right 
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guess of T fa T*, the cost is at most (a 2 + 0)T*. It remains to output a bad fe-set as well, and hence the following 
definition is useful. 



Definition 2.3 An algorithm for a robust problem is strongly discriminating if it satisfies the properties in Def- 
inition |Zi[ and when the inflation parameter is X = 1 (and hence <£* = 0), the algorithm also outputs a set 
Qt G ( ) such that if c(<3?t) > a^T". fAe corf of optimally covering the set Qt is > T. 

Recall that for a covering problem II, the cost of optimally covering the set of requirements Q G (^') is Opt(Q) = 
mm{c(E Q ) \E Q CE and E Q G 7^ Vi € Q}. 

Lemma 2.4 If there is an (a\ , a 2 , fi)-strongly-discriminating algorithm for a robust covering problem Robust^. (II), 
then for every e > ?/zere ijr ara algorithm for k-max-min(U) that outputs a set Q such that for some T, the optimal 
cost of covering this set Q is at least T, but every k-set can be covered with cost at most (1 + e) • (02 + /?) T. 



Proof: The approximation algorithm for MaxMin(II) is similar to that in Lemma 2.2. Let A denote an algorithm 
for the robust problem that is (ai, a 2 , P) strongly discriminating. Recall that the A;-max-min instance corresponds 
to the Robustfc(II) instance with A = 1, and hence we will run algorithm A on this robust instance. Also from 



Definition 2.\\ T* denotes the optimal second-stage cost of Robustfe(II), and its optimal fist-stage cost <£* = 



(since A = 1). Note that the optimal value of the /c-max-min instance also equals T*. 

Let ground-set E = [m], and c max := max eg [ m j c e . By scaling, we may assume WLOG that all costs in the 
instance are integral. Let e > be any value as given by the lemma (where j is polynomially bounded), and 
N := [~log 1+(E (m c rnax )~\ + 1; note that N is polynomial in the input size. Consider the integral powers of (1 + e), 



T := {tij-lo, where t t = (1 + e) 1 for i = 0, 1, • • • , N. 

The approximation algorithm for MaxMin(II) runs the strongly discriminating algorithm A for every choice of 
T G T, and let p G {1, • • • , N} be the smallest index such that c(&(t p )) < 012 t p . Observe that there must exist 



such an index since for all T > T* , we have c(§t) < «2 T* < a 2 T (property B in Definition 2.1 , using = 0), 
and clearly T* < m ■ c max < t^. The algorithm then outputs Q(t p ^i) as the max-min scenario. Below we prove 
that it achieves the claimed approximation. We have for all T > 0, 

T* = max jopt(L>) : D G ftjjj j < max jc($ r ) + c(Augment T (D)) : D G j < c($ T ) + P T. 



Above, the inequalities are by conditions A(i) and A(ii) of Definition 2.1. Setting T = t p here, and by choice of p, 



T* <c(<S>{t p )) + pt p <(a 2 +P)t p . 



Hence t p is a (a 2 + /^-approximation to the max-min value T*. Now applying the condition of [Definition 2.3 
with T = tp-i, since c($(£p_i)) > a 2 t v -\ (by choice of index p), we obtain that the minimum cost to cover 
requirements Q(t p -i) is at least: 

t p T* 
h ~ l ~ 1 + e - (l + e)-(Q 2 + /3)' 
which implies the desired approximation guarantee. ■ 

3 /c-Robust Set Cover 

Consider the A;-robust set cover problem where there is a set system (U, F) with a universe of \U\ = n elements, 
and m sets in F with each set R G F costing cr, an inflation parameter A, and an integer k such that each of 
the sets (^) is a possible scenario for the second-stage. Given Lemma 2.2 , it suffices to show a discriminating 



algorithm as defined in Definition 2.1 for this problem. The algorithm given below is easy: pick all elements which 



can only be covered by expensive sets, and cover them in the first stage. 



Algorithm 1 Algorithm for /c-Robust Set Cover 



1: input: -robust set-cover instance and threshold T. 

2: let /3 <— 36 In m, and S <— {v G U | min cost set covering v has cost at least f3 ■ ? }. 

3: output first stage solution <I>r as the Greedy-Set-CoverCS 1 ). 

4: define Augmented?}) as the min-cost set covering i, for i £ U \ S; and Augment T ({i}) = for i G S. 

5: output second stage solution Augment^ where Augment T (Z?) := [j ieD Augment^di}) for all D C {/. 



Claim 3.1 (Property A for Set Cover) For allT > awe? scenario D G (^), f/ze seta <3?y U Augment-^ (-D) cover 
elements in D, and have cost c(Augment r (-D)) < (3T. 

Proof: The elements in D n S are covered by <J>r; and by definition of Augment^, each element i G D \ S is 
covered by set Augment T ({i}). Thus we have the first part of the claim. For the second part, note that by definition 
of S, the cost of Augment T ({i}) is at most pT/k for all i G U . ■ 

Below H n := Ya=i i ~ Inn; recall that the greedy algorithm for set cover is an ^-approximation where n is 
the number of elements in the given instance. 

Theorem 3.2 (Property B for Set Cover) Let $>* denote the optimal first stage solution (and its cost), and T* the 
optimal second stage cost. Let (3 = 36 In m. If T > T* then c($t) < H n ■ (<&* + 12 • T*). 

Proof: We claim that there is a fractional solution x for the set covering instance S with small cost (9($* + 
T*), whence rounding this to an integer solution implies the theorem. For a contradiction, assume not: let every 
fractional set cover be expensive, and hence there must be a dual solution of large value. We then round this dual 
solution to get a dual solution to a sub-instance with only k elements that costs > <£* + T* , which is impossible 
(since using the optimal solution we can solve every instance on k elements with that cost). 

To this end, let S' C S denote the elements that are not covered by the optimal first stage <&*, and let F' C F denote 
the sets that contain at least one element from S' . By the choice of S, all sets in T' cost at least (3 ■ ? > /3 ■ 
Define the "coarse" cost for a set R G T' to be cr = \ 6 £?/ k ~\ ■ For each set R G J 7 ', since cr > > ^jp, it 
follows that cr ■ G [cr, 2 • cr), and also that cr > /3/6. 

Now consider the following primal-dual pair of LPs for the set cover instance with elements S' and sets F' having 
the coarse costs c. 



min Erg.f' ' X R max E eS 5' Ve 

Ej? 3e X R > h Ve G 5', EeGfl^e < cr, yR G J 7 ', 

x R > 0, Vi? G T' . y e > 0, Ve G S' . 

Let {x r\ r^jti be an optimal primal and {y e } e eS' an optimal dual solution. The following claim bounds the 
(coarse) cost of these fractional solutions. 

Claim 3.3 If f3 = 36 In m, then the LP cost is YIr^j 7 ' cr ' X R = Sees' Ve — 2 • k. 



Before we prove Claim 3.3 , let us assume it and complete the proof of [Theorem 3.2 . Given the primal LP solution 



{xr^rizjti to cover elements in 5', define an LP solution to cover elements in S as follows: define zr = 1 if 
R G <&*, zr = xr if R G T' \ $*; and zr = otherwise. Since the solution z contains <£* integrally, it covers 
elements S \ S' (i.e. the portion of S covered by <3?*); since zr > xr, z fractionally covers S'. Finally, the cost of 
this solution is Y^r crzr < + Y^r crxr < + ^ • YrCR x R- But |Claim 3~3| bounds this by $* + 12 • T*. 
Since we have a LP solution of value $* + 12T*, and the greedy algorithm is an ^-approximation relative to the 
LP value for set cover, this completes the proof. ■ 



Claim 3.1 and [Theorem 3.2 show our algorithm for set cover to be an (H n , 12if n ,361nm)-discriminating algo- 



rithm. Applying Lemma 2.2 converts this discriminating algorithm to an algorithm for A; -robust set cover, and gives 



the following improvement to the result of [|TJ|]. 
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Theorem 3.4 There is an 0(log m + log n)-approximation for k-robust set cover. 



It remains to give the proof for |Claim 3.3| above; indeed, that is where the technical heart of the result lies. 

Proof of |Claim 3.3| : Recall that we want to bound the optimal fractional set cover cost for the instance (£", J 7 ') 
with the coarse (integer) costs; xr and y e are the optimal primal and dual solutions. For a contradiction, assume 
that the LP cost YIrpt' ^R x R = /CeeS' D e ^ es * n tne urut i nterva l ((7 ~~ 7&] f° r some integer 7 > 3. 
Define integer-valued random variables {Y e } e€ s' by setting, for each e G 5' independently, Y e = [y e \ + I e , 
where I e is a Bernoulli(y e — [y e \) random variable. We claim that whp the random variables Y e /3 form a feasible 
dual — i.e., they satisfy all the constraints {^eeiiO'e/^) — ^r}r&T' w hh high probability. Indeed, consider a 
dual constraint corresponding to R G T'\ since we have Yle£RlVe\ — CR> we § et that Pr[^ e6fl y e > 3 • cr] < 
P r Eeei? > 2 • cr]. But now we use a Chernoff bound [ ^8| ] to bound the probability that the sum of independent 
0-1 r.v.s, £ 

e gR -^e> exceeds twice its mean (here X/eeR^I^e] — X^ee-R^A 2 — hy e CR I^ < e ^/^^ < m 2 , 
since each cr > /3/6 and /3 = 36 • mm. Finally, a trivial union bound implies that Y e /3 satisfies all the m 
contraints with probability at least 1 — 1/m. Moreover, the expected dual objective is J2 e eS' Ve > (l — l)k > 1 
(since 7 > 3 and k > 1), and by another Chernoff Bound, P^[^2 ee s' Y e > ■ k] > a, where a > is some 
constant. Putting it all together, with probability at least a — ^, we have a feasible dual solution Y' e := Y e /3 with 
objective value at least 2^- • k. 

Why is this dual Y' e any better than the original dual y e ? It is "near-integral" — specifically, each Y' e is either zero 
or at least |. So order the elements of 5' in decreasing order of their y'-value, and let Q be the set of theirs? k 
elements in this order. The total dual value of elements in Q is at least minl^^pfc, |} > |, since 7 > 3, and each 
non-zero Y' value is > 1/3. This valid dual for elements in Q shows a lower bound of | on minimum (fractional) 
c-cost to cover the k elements in Q. Using cr > • cr for each R G J 7 ', the minimum c-cost to fractionally 
cover Q is > • | = T*. Hence, if Q is the realized scenario, the optimal second stage cost will be > T* (as no 
element in Q is covered by $*) — this contradicts the fact that OPT can cover Q G (^) with cost at most T*. Thus 
we must have 7 < 2, which completes the proof of |Claim 33 . ■ 



The fc-Max-Min Set Cover Problem. The proof of Claim 3.3 suggests how to get a [H n , 12H n , 36 mm 



strongly discriminating algorithm. When A = 1 (and so <&* = 0), the proof shows that if c(3>t) > 12if n • T, 
there is a randomized algorithm that outputs fc-set Q with optimal covering cost > T (witnessed by the dual solu- 
tion having cost > T). Now using ^emma 2A , we get the claimed 0(log m + log n) algorithm for the /c-max-min 



set cover problem. This nearly matches the hardness of O( lo °^ m + log n) given by [14] 



Remarks: The result above (as well as the [14] result) also hold in the presence of set-dependent inflation 



factors — details appear in Appendix B. Results for the other covering problems do not extend to the case of non- 



uniform inflation: this is usually inherent, and not just a flaw in our analysis. Eg., Q27[ ] give an ^(log 1 / 2 ^ n) hard 
ness for fc-robust Steiner forest under just two distinct inflation-factors, whereas we give an 0(l)-approximation 



under uniform inflations (in Section 6). 



4 &>Robust Minimum Cut 



We now consider the fc-robust minimum cut problem, where we are given an undirected graph G = (V, E) with 
edge capacities c : E — > R + , a root r G V, terminals U C V, inflation factor A. Again, any subset in (, ) is a 
possible second-stage scenario, and again we seek to give a discriminating algorithm. This algorithm, like for set 
cover, is non-adaptive: we just pick all the "expensive" terminals and cut them in the first stage. 

Claim 4.1 (Property A for Min-Cut) For all T > and D G (y , the edges $r U Augment T (-D) separate the 
terminals D from r; moreover, the cost c(Augment T (D)) < /3T. 

Theorem 4.2 (Property B for Min-Cut) Let denote the optimal first stage solution (and its cost), and T* the 
optimal second stage cost. If f3 > ^ and T >T* then c($t) < 3 • + | • T*. 
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Algorithm 2 Algorithm for /c-Robust Min-Cut 



1: input: -robust minimum-cut instance and threshold T. 

2: let /3 <— 0(1), and S <— {v G [7 | min cut separating i> from root r has cost at least (3 ■ -jr}. 

3: output first stage solution as the minimum cut separating S from r. 

4: define Augment T ({i}) as the min-r-i cut in G \ for i <E U \ S; and Augment T ({i}) = for i G S 1 . 

5: output second stage solution Augment^ where Augment T (Z?) := [j ieD Augment r ({i}) for all D C [/. 



Here's the intuition for this theorem: As in the set cover proof, we claim that if the optimal cost of separating S 
from the root r is high, then there must be a dual solution (which prescribes flows from vertices in S to r) of large 
value. We again "round" this dual solution by aggregating these flows to get a set of k terminals that have a large 
combined flow (of value > <£* + T*) to the root — but this is impossible, since the optimal solution promises us a 
cut of at most <£* + T* for any set of k terminals. 

However, more work is required. For set-cover, each element was either covered by the first-stage, or it was not; 
for cut problems, things are not so cut-and-dried, since both stages may help in severing a terminal from the root! 
So we divide S into two parts differently: the first part contains those nodes whose min-cut in G is large (since 
they belonged to S) but it fell by a constant factor in the graph G \ These we call "low" nodes, and we 
use a Gomory-Hu tree based analysis to show that all low nodes can be completely separated from r by paying 
only 0(&*) more (this we show in |Claim 4.3 ). The remaining "high" nodes continue to have a large min-cut in 



G \ <&*, and for these we use the dual rounding idea sketched above to show a min-cut of 0(T*) (this is proved in 



Claim 4.4). Together these claims imply Theorem 4.2 



To begin the proof of Theorem 4.2, let H := G \ <&*, and let 5/, C 5 denote the "high" vertices whose min-cut 

2 ' fc 



from the root in H is at least M := % ■ The following claim is essentially from Golovin et al. [M. 



Claim 4.3 (Cutting Low Nodes) IfT > T*, the minimum cut in H separating S \ S^from r costs at most 2 • <£*. 

Proof: Let S' := S \ S h , and t := (3 ■ For every v G S', the minimum r - v cut is at least /3 • j > /3 ■ ^ = 2M 
in G, and at most M in H. Consider the Gomory-Hu (cut-equivalent) tree T{H) on graph H rooted at r [32, 
Chap. 15]. For each u G S' let D u C V denote the minimum r — u cut in T(H) where u G D u and r D u . 
Pick a subset S" C S' of terminals such that the union of their respective min-cuts in T(H) separate all of S' from 
the root and their corresponding sets D u are disjoint (the set of cuts in tree T(H) closest to the root r gives such 
a collection). It follows that (a) {D u | u G S"} are disjoint, and (b) F := U u ^s"dH{D u ) is a feasible cut in H 
separating S' from r. Note that for all u G S", we have c(dH(D u )) < M (since it is a minimum r-u cut in H), and 
c{d G (D u )) > 2M (it is a feasible r-u cut in G). Thus c(d H (D u )) < c(d G (D u )) - c(d H (D u )) = c(d^*{D u )). 
Now, c(F) < J2ueS" c (<9h(-D«)) < S«eS" c (<9$* (D u )) < 2 • <&*. The last inequality uses disjointness of 
{D u } u£ S"- Thus the minimum r — S' cut in H is at most 2$*. ■ 

Claim 4.4 (Cutting High Nodes) IfT >T*, the minimum r-Sh cut in H costs at most f • T*, when (3 > i^f . 

Proof: Consider an r-Sh max-flow in the graph H = G \ <!>*, and suppose it sends «j • M flow to vertex i G S^. 
By making copies of terminals, we can assume each «j G (0, 1]; the k -robust min-cut problem remains unchanged 
under making copies. Hence if we show that J2i^s h ai — ^> tne tota ^ ^ ow (which equals the min r-Sh cu would 
be at most k ■ M = ^ ■ T*, which would prove the claim. For a contradiction, we suppose that J2 i&Sh on > k. 
We will now claim that there exists a subset W C with \ W\ < k such that the min r-W cut is more than T*, 
contradicting the fact that every fc-set in H can be separated from r by a cut of value at most T*. To find this set 
W, the following redistribution lemma (proved at the end of this theorem) is useful. 

Lemma 4.5 (Redistribution Lemma) Let N = (V,E) be a capacitated undirected graph. Let X C V be a set of 

terminals such min-cut n (i , j) > lfor all nodes i,j G X. For each i G X, we are given a value G (0, 1]. Then 
for any integer I < ^2 i€X €i > ^ iere exists a subset W C X with \ W\ < I vertices, and a feasible flow f in N from 
XtoWso that ( i) the total f-flow into W is at least — | — • i and ( ii) the f-flow out of each i G X is at most / 4. 
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We apply this lemma to H = G \ <E>* with terminal set Sh, but with capacities scaled down by M. Since for any 
cut separating x, y G Sh, the root r lies on one side on this cut (say on y's side), min-cut#(x, y) > M — hence the 
scaled-down capacities satisfy the conditions of the lemma. Now set £ = k, and := ctj for each terminal i G Sh', 
by the assumption YlieS e * = ^2ies h &i > k = £. Hence ^emma 43 finds a subset W C Sh with k vertices, and 



a flow / in (unsealed) graph H such that / sends a total of at least 1 ~| |/e • kM units into W, and at most ^ • M 
units out of each i G Also, there is a feasible flow g in the network H that simultaneously sends • M flow 
from the root to each i G Sh, namely the max-flow from r to Sh- Hence the flow is feasible in H, and sends 
at least | • X ~ X J e ■ kM = 1 ~^ //e • kM units from r into W. Finally, if (3 > ^ff , we obtain that the min-cut in H 
separating W from r is greater than T*: since \W\ < k, this is a contradiction to the assumption that any set with 
at most k vertices can separated from the root in H at cost at most T* . ■ 



From Claim 4. 1 and [Theorem 4.2 , we obtain a (3, §, 0) -discriminating algorithm for A; -robust minimum cut, when 



P > ^ZTj"- We set (3 = and use Lemma 2.2 to infer that the approximation ratio of this algorithm is max{3, ^ + 
/?} = ^ + /3. Since picking edges only in the second-stage is a trivial A-approximation, the better of the two gives 
an approximation of min{^ + /3, A} < 17. Thus we have, 

Theorem 4.6 (Min-cut Theorem) There is a 17 -approximation algorithm for k-robust minimum cut. 

It now remains to prove the redistribution lemma. At a high level, the proof shows that if we add each vertex i G X 
to a set W independently with probability ei ij (£V e^), then this set W will (almost) satisfy the conditions of the 
lemma whp. A natural approach to prove this would be to invoke Gale/Hoffman-type theorems [|32|, Chap. 11]: 
e.g., it is necessary and sufficient to show that c{dV) > |demand(y) — supply (V')\ for all V C V for this 
random choice W. But we need to prove such facts for all subsets, and all we know about the network is that 
the min-cut between any pair of nodes in X is at least 1! Also, such a general approach is likely to fail, since the 
redistribution lemma is false for directed graphs (see remark at the end of this section) whereas the Gale-Hoffman 
theorems hold for digraphs. In our proof, we use undirectedness to fractionally pack Steiner trees into the graph, 
on which we can do a randomized-rounding-based analysis. 



Proof of Lemma 4.5 (Redistribution Lemma): To begin, we assume w.l.o.g. that the bounds = 1/P for all 
i 6 I for some integer P. Indeed, let P G N be large enough so that ii = €{P is an integer for each i £ I. 
Add, for each i G X, a star with e« — 1 leaves centered at the original vertex i, set all these new vertices to also 
be terminals, and let all new edges have unit capacity. Set the new e's to be 1/P for all terminals. To avoid excess 
notation, call this graph N as well; note that the assumptions of the lemma continue to hold, and any solution W 
on this new graph can be mapped back to the original graph. 

Let c e denote the edge capacities in N, and recall the assumption that every cut in N separating X has capacity 
at least one. Since the natural LP relaxation for Steiner-tree has integrality gap of 2, this implies the existence 
of Steiner trees {T a } ae A on the terminal set X that fractionally pack into the edge capacities c. I.e., there exist 
positive multipliers {X a } a eA such that J2 a = h an( * S fl ^« ' x(T a ) < c, where x(T a ) is the characteristic 
vector of the tree T a . Choose W C X by taking £ samples uniformly at random (with replacement) from X. We 
will construct the flow / from I to If as a sum of flows on these Steiner trees. In the following, let q := \X\; note 
that£ < \X\e = q/P. 

Consider any fixed tree T a in this collection, where we think of the edges as having unit capacities. We claim that 
in expectation, units of flow can be feasibly routed from X to W in T a such that each terminal supplies at 
most £/q. Indeed, let r a denote an oriented Euler tour corresponding to T a . Since the tour uses any tree edge twice, 
any feasible flow routed in r a (with unit-capacity edges) can be scaled by half to obtain a feasible flow in T a . We 
call a vertex v G X a-close if there is some W- vertex located at most q/£ hops from v on the (oriented) tour r a . 
Construct a flow f a on r a by sending £/q flow from each a-close vertex v G X to its nearest VF-vertex along r a . 
By the definition of a-closeness, the maximum number of flow paths in f a that traverse an edge on r a is q/£; since 
each flow path carries £/q flow, the flow on any edge in r a is at most one, and hence f a is always feasible. 
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For any vertex v G X and a tour r a , the probability that v is not a-close is at most (1 — < e 1 ; hence 

v G X sends flow in /„ with probability at least 1 — e . Thus the expected amount of flow sent in f a is at least 
(1 — e _1 )|X| • (£/q) = (1 — e^ 1 ) • I. Now define the flow f ■= n A a • fa by combining all the flows along 
all the Steiner trees. It is easily checked that this is a feasible flow in N with probability one. Since J2 a = \, 
the expected value of flow / is at least 1 ~^ |/e ^. Finally the amount of flow in / sent out of any terminal is at most 
\ • £/q < jp- This completes the proof of the redistribution lemma. ■ 



The A>max-min Min-Cut Problem. When A = 1 and $* = 0, the proof of Theorem 4.2 gives a randomized 



algorithm such that if the minimum r-S cut is greater than it finds a subset W of at most k terminals such 
that separating W from the root costs more than T (witnessed by the dual value). Using this we get a randomized 
(3, % , 0) strongly discriminating algorithm, and hence a randomized 0(l)-approximation algorithm for /c-max-min 



min cut from Lemma 2.4. We note that for fc-max-min min-cut, a (1 — 1/e) -approximation algorithm was already 
known (even for directed graphs) via submodular maximization. However the above approach has the advantage 
that it also extends to fc-robust min-cut. 

Bad Example for Directed Graphs. Let us show that our theorems for -robust min-cut have to use the undirect- 
edness of the graph crucially, and that the theorems are in fact false for directed graphs. Consider the digraph G 
with a root r, a "center" vertex c, and I terminals vi, V2, ■ ■ ■ , ve- This graph has arcs are (c, r), {(r, Vi)} ie w and 
{(vi, c)}j S m; each having unit capacity. Note that the min-cut between every Vi-vj pair is 1, but if we give each of 
the ViS €i = flow, there is no way to choose yl of these vertices and collect a total of f2(v^) flow at these 
"leaders". This shows that the redistribution lemma ( [Lemma 4~3 ) is false for digraphs. 



A similar example shows that that thresholded algorithms perform poorly for -robust directed min-cut, even for 
k = 1. Consider graph D with vertices r, c and {fi}j e m as above. Graph D has unit capacity arcs {{vi,r)} i£ w, 
and VI capacity arcs (c, r) and {(vi,c)} ie ^. The inflation factor is A = \fl. The optimal strategy is to delete 
the arc (c, r) in the first stage. Since k = 1, one of the terminals Vi demands to be separated from the root in the 
second stage, whence deleting the edge (vi, r) costs A • 1 = y/l resulting in a total cost of 2y/l. However, any 
threshold-based algorithm would either choose none of the terminals (resulting in a recourse cost of = £), or 
all of them (resulting in a first-stage cost of at least I). 

5 fc-Robust Multicut 

We now consider the multicut problem: we are given an undirected graph G = (V, E) with edge-costs c : E — ^ IR + , 
and m vertex-pairs In the k -robust version, we are also given an inflation parameter A and bound k 

on the cardinality of the realized demand-set. Let denote the optimal first stage solution (and its cost), and T* 
the optimal second stage cost; so Opt = <1>* + A • T*. The algorithm (given below) is essentially the same as for 
minimum cut, however the analysis requires different arguments. 

Algorithm 3 Algorithm for /c-Robust MultiCut 
1: input: fc-robust multicut instance and threshold T. 

2: let p := O(logra) be the approximation factor in Racke's oblivious routing scheme [[3C|], e G (0, \) any 

constant, and/5:= p- 
3: let S <— {i G [m] \ min Sj-ij cut has cost at least j3 ■ ?}. 



output first stage solution &t as the 0(log n)-approximate multicut [|TJ] for S. 
define Augment T ({i}) as edges in the min Sj — tj cut, for i G [m] \ S; and Augment T ({i}) = for i G S. 
output second stage solution Augment^ where Augment T (u;) := [j ieu) Augment T ({i}) for all oj C [m]. 



Claim 5.1 (Property A for Multicut) For all T > and uj C [m], the edges <1>t U Augment T (o;) separate Sj 
and ti for all i G oj; additionally if\oj\ < k then the cost c(Augment r (^)) < f3T. 



II 



Proof: Pairs in uj n S are separated by By definition of Augment T , for each pair i G ui \ S edges 

Augment T ({i}) form an Sj — ij cut. Thus we have the first part of the claim. For the second part, note that 
by definition of S, the cost of Augment T ({i}) is at most (3T/k for all i G [m]. ■ 

Theorem 5.2 (Property B for Multicut) IfT > T* then c($ T ) < 0(log n) ■ $* + 0(log 2+e n) ■ T*. 



To prove the theorem, the high level approach is similar to that for A;-robust min-cut. We first show in Lemma 5.3 
that the subset of pairs S C S whose min-cut fell substantially on deleting the edges in 3>* can actually be 
completely separated by paying 0(1)<3?*. This is based on a careful charging argument on the Gomory-Hu tree 



and generalizes the [20] lemma from min-cut to multicut. Then in Lemma 5.6 we show that the remaining pairs 
in S \ S can be fractionally separated at cost 0(log 1+e n) T*. This uses the dual-rounding approach combined 



with Racke's oblivious routing scheme Q30[]. Finally since the [ |18| ] algorithm for multicut is relative to the LP, this 
would imply Theorem 5.2| . 



Let us begin by formally defining the cast of characters. Let H := G \ <&* and M := /3 ■ j-. Define, 

5 := [i G S | min cost Sj-ij cut in H is less than ^} 
to be the set of pairs whose mincut in G was at least M, but has fallen to at most M/4 in H = G \ <&*. 
Lemma 5.3 IfT> T*, there is a multicut separating pairs S in graph H which has cost at most 2 $*. 

Proof: We work with graph H = (V, F) with edge-costs c : F — > M. A cluster refers to any subset of vertices. 
A cut equivalent tree (c.f. JlO|]), P = (N(P), E(P)) is an edge-weighted tree on clusters N(P) = {Nj} r - =1 such 
that: 

• the clusters {Nj} r - =l form a partition of V, and 

• for any edge e G E(P), its weight in P equals the c-cost of the cut corresponding to deleting this edge in 
P. I.e., if (S e , Sg) is the partition of V obtained by unioning the vertices in the clusters belonging to the two 
connected components of P \ {e}, then e's weight in P equals c(5(S e )) = c(5(S°)). 

The Gomory-Hu tree Pqh = (V, E{Pqh)) of H is a cut-equivalent tree where the clusters are singleton vertices, 
and which has the additional property that for every u, v G V the minimum u-v cut in Pqh equals the minimum 
u-v cut in H. For any cut-equivalent tree, a cluster C V is called active if there is some i G S such that 
\N n {sj,tj}| = 1; otherwise the cluster A^ is called dead. We obtain a cut-equivalent tree Q from Pqh by 
repeatedly performing one of the following modifications: (1) for each edge having weight greater than merge 
the clusters corresponding to its end points; and (2) for each dead cluster, merge it with any of its neighboring 
clusters. Note that in the resulting tree Q, every edge in E(Q) has weight at most and every cluster in J\f(Q) 
is active. Let V := UAreAf(Q) ®h(N). In the next two claims we show that V is a feasible multicut for S with cost 
at most 2$*. 

Claim 5.4 T> is a feasible multicut separating pairs S in H. 

Proof: Clearly for each pair % G S, vertices Sj and tj are in distinct active clusters of the Gomory-Hu tree 
Pgh- Additionally there is some edge of weight less that ^ on the Sj — U path in Pgh' since the minimum 
Si — U cut in H is less than Observe that in obtaining tree Q from Pgh, we never contract two active 
clusters nor an edge of weight less that Thus Sj and t{ lie in distinct clusters of Q. Since this holds for all 
i G S, the claim follows by definition of V. ■ 

Claim 5.5 The cost c(V) = Y,e&v c e < 2 $*> 'f T > T *- 

Proof: Consider any cluster A^ G M(Q). Since all clusters in M(Q) are active, A^ contains exactly one of 
{sj, ti} for some i G S. Hence the cut da(N) (in graph G) has cost at least (3 ■ ? > j3 ■ ^ = M, by definition 
of the set S D S. 
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Let jV 2 (Q) C M(Q) denote all clusters in Q having degree at most two in Q. Note that |A/" 2 (Q)| > \\N(Q)\. 
Using the above observation and the fact that clusters in N~2{Q) are disjoint, we have 

\M 2 (Q)\M< Y, c ( d G(N))= E «d H (N)) + c(d^(N)))< Y, c(d H (N)) + 2$*. (5.1) 

N£JV 2 (Q) N£JV 2 {Q) N£Af 2 (Q) 

We now claim that for any N G M 2 {Q), the cost c(9#(./V)) < Let e\ and e 2 denote the two edges 
incident to cluster N in Q (the case of a single edge is easier). Let (Ui, V \ Ui) denote the cut corresponding 
to edge e/ (for I = 1, 2) where JV C [/;. Each of these cuts has cost c(<9# ([/;)) < ^ by property of cut- 
equivalent tree Q, and their union dn{Ui) |J dH(U 2 ) is the cut separating N from V\N. Hence it follows that 



c(d H {N)) < 2 • ^ = 4f . Using this in (O) and simplifying, we obtain \M(Q)\ M < 2 ■ \Af 2 (Q)\ M < 8 



For each edge e G E(Q), let D e C F denote the edges in graph H that go across the two components of 
Q \ {e}. By the property of cut-equivalent tree Q, we have c(D e ) < Since V = U e e£(<2) 

c(P)< ^ c ( jDe )<| jE ;(Q)|^<|AA(Q)|^<2$* 
This proves the claim. ■ 



Combining Claims 5.4 and 5.5, we obtain the lemma. 



Now we turn our attention to the remaining pairs W := S \ S, and show that there is a cheap cut separating them 
in H. For this we use a dual-rounding argument, based on Racke's oblivious routing scheme. Recall that constant 
< e < \, p = 0(log n) (Racke's approximation factor), and /3 = p • e |og°fg" n - Define a := ep • log e n. 

Lemma 5.6 There exists a fractional multicut separating pairs W in the graph H which has cost 8a ■ T*. 

Proof: For any demand vector d : W — > R + , the optimal congestion of routing d in H, denoted Cong(d), is 
the smallest r\ > such that there is a flow routing d{ units of flow between Sj and £j (for each i G W), using 
capacity at most i] • c e on each edge e € H. Note that for every i € W, the Sj-ij min-cut in H has cost at least 
L := ^ = f" Hence for any z G W, the optimal congestion for a unit demand between Si-U (and zero 
between all other pairs) is at most ^. 



Now consider Racke's oblivious routing scheme [ p0| ] as applied to graph H. This routing scheme, for each i G VF, 
prescribes a unit flow T{ between Sj-tj such that for every demand vector d : W — > M + , 

max ^ ,lGW 1 < p . Cong(d), where p = O (log n); 

i.e., the congestion achieved by using these oblivious templates to route the demand d is at most p times the best 
congestion possible for that particular demand d. 

Now consider a maximum multicommodity flow in H; suppose that it sends yi ■ j- units between Si,ti for each 
i G W. For a contradiction, suppose that J2i^w Hi > & a ' k. (Otherwise the maximum multicommodity flow, 
and hence its dual, the minimum fractional multicut is at most 8aT*, and the lemma holds.) By making copies of 
vertex-pairs, we may assume that yi G [0, 1] for all i G W; this does not change the k -robust multicut instance. 
Define a (not necessarily feasible) multicommodity flow Q := J2ieW Xi'jr"^ where each Xi is an independent 
0-1 random variable with Pr[X^ = 1] = and F L is the Racke oblivious routing template. The flow has expected 
magnitude at least ^ — tt > 8T*, and is the sum of {0, ^}-valued random variables, hence by a Chernoff 
bound: 

Claim 5.7 With constant probability, the magnitude of flow Q is at least T*. 



Claim 5.8 The flow Q is feasible with probability 1 — o(l). 
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Proof: Fix any edge e G H, and let Ui{e) := j- • Fi(e) for all i G W. Note that the random process gives 
us a flow of ^ Xi ■ Ui(e) on the edge e. The feasibility of the maximum multicommodity flow says that 
Cong({yj • jr}iew) < 1- Since oblivious routing loses only a p factor in the congestion, 

J2iew Vi ■ u i(e) = Eieiy Vi ' IT ' -^iO 5 ) - P ' Ce ' 
and the expected flow on edge e sent by the random process above is X^eW oT ' u «( e ) — a c e- 

Now, since the min Sj-ij-cut is at least L for any i G W, a unit of flow can (non-obliviously) be sent between 
Si, ti at congestion at most ^. Hence using the oblivious routing template JFi incurs a congestion at most 
Hence, 



Ui{e) = ^-Ti{e) <Tl.P.. Ce = ^P 



We divide the individual contributions by the edge capacity and further scale up by by defining new [0, 1]- 



random variables K- 



Xj-Uj(e) 



_§_ 



We get that p, := E[J2 Yi] < f-. Recall the Chernoff bound that says 



that for independent [0, 1] -valued random variables Y{, 



Pr 



\ A«(l+<5) 



Using this with p(l + 5) = (hence 5 + 1 > 2) we get 



< 



that 



1 + 5, 



Pr 



i 



• Ui(e) > c e 



Pr 



exp — e log log n 



41ogn 
e log log n 



n 1 



since a = ep • log e n and ft = p ■ 



| 61 ? gw . Now a trivial union bound over all n 2 edges gives the claim. 

e log log n ° ° 



By another union bound, it follows that there exists a feasible multicommodity flow Q that sends either zero or j- 
units for each pair i G W, and the total value of Q is at least T*. Hence there exists some fc-set W C W such 
that the maximum multicommodity flow for W' on H is at least T* . This contradicts the fact that every fc-set has 
a multicut of cost less than T* in H. Thus we must have J2iew Vi — & a ' ^> which implies Lemma 5^6[ ■ 

Combining |Lemmas 33 and 5.6, we obtain a. fractional multicut for pairs S in graph G, having cost O(l) • + 
0(log 1+<E n) • T*. Since the Garg et al. [[T|] algorithm for multicut is an 0(log n)-approximation relative to the LP, 
we obtain [Theorem 5.2 



From Claim 5.1 and Theorem 5.2 , it follows that this algorithm is O (log n, log 2+e n, 0) -discriminating for k- 
robust multicut. Since /3 = 0(log 2 nj log log n), using pemma 2T2 , we obtain an approximation ratio of: 



max < log n, 



log 2 n 
log log n 



+ 



log 



2+€ 
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This is an O 



log n 
log log n 



approximation when A > log 2e n. On the other hand, when A < log^ e n, we can use 



2e 



the trivial algorithm of buying all edges in the second stage (using the GVY algorithm [18]); this implies an 



0(log 1+2e n) -approximation. Since log 



l+2e 



n 



log n 



, we obtain: 



Theorem 5.9 There is an O 



log log n / 

log log n I -approximation algorithm for k-robust multicut. 



log n 



The fc-max-min Multicut Problem. The above ideas also lead to a (c\ ■ logn, C2 • log 2 n, cs ■ log 2 n) strongly 
discriminating algorithm for multicut, where c±, C2, C3 are large enough constants. The algorithm is exactly Algo- 
rithm 5 with parameter (3 := ©(log n) ■ p with an appropriate constant factor; recall that p = 0(log n) is the approx- 
imation ratio for oblivious routing [0C|]. [Lemma 5^6 shows that this algorithm is [c\ ■ log n, C2 ■ log 2 n, C3 • log 2 n) 
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discriminating (the parameters are only slightly different and the analysis still applies). To establish the property 
in pefinition 2.3 , consider the case A = 1 (i.e. = 0) and c($r) > (c2 log 2 n) • T. Since the Jl8| ] algorithm is 



0(log ra)-approximate relative to the LP, this implies a feasible multicommodity flow on pairs W (since $* = 
we also have W = S) of value at least (04 logn) • T for some constant C4. Then the randomized rounding (with 
oblivious routing) can be used to produce a /c-set W C W and a feasible multicommodity flow on W of value 



at least T; by weak duality it follows that the minimum multicut on W' is at least T and so Definition 2.3 holds. 



Thus by Lemma 2A we get a randomized 0(log 2 n)-approximation algorithm for fc-max-min multicut. 



All-or-Nothing Multicommodity Flow. As a possible use of the oblivious routing and randomized-rounding 
based approach, let us state a result for the all-or-nothing multicommodity flow problem studied by Chekuri et 
al. given a capacitated undirected graph G = (V, E) and source-sink pairs {si, t{} with demands di such that 
the min-cut(sj, ij) = O(log 2 n)di, one can approximate the maximum throughput to within an O(logn) factor 
without violating the edge-capacities, even with <i max > c m i n — the results of Chekuri et al. H ^] violated the 
edge-capacities in this case by an additive d max . This capacity violation in the previous all-or-nothing results is 
precisely the reason they can not be directly used in our analysis of -robust multicut. 

Summarizing Properties from Dual Rounding. The proofs for all problems considered so far (set cover, mini- 
mum cut, multicut) used certain dual rounding arguments. We now summarize the resulting properties in a self- 
contained form. 

Theorem 5.10 Consider any instance of set cover; let B E R + and k E 7L+ be values such that 

• the set of minimum cost covering any element costs at least 36 mm • and 

• the minimum cost of covering any k-subset of elements is at most B. 

Then the minimum cost of covering all elements is at most 0(log n) • B. 

Theorem 5.11 Consider any instance of minimum cut in an n-vertex undirected graph with root r and terminals 
X; let B E M + and k E Z + be values such that 

• the minimum cut separating r and u costs at least 10 • for each terminal tifl. 

• the minimum cut separating r and S costs at most B, for every k-set S E ( k ) . 

Then the minimum cut separating r and all terminals X costs at most 0(1) • B. 

Theorem 5.12 Consider any instance of multicut in an n-vertex undirected graph with source-sink pairs {sj, ti} i& \ m 
let B E and k E Z + be values such that 

• the minimum Si — tj cut costs at least c ■ log 2 n ■ -p for each pair i E [m]. 

• the minimum multicut separating pairs in P costs at most B, for every k-set P E ( ^ )• 

Then the minimum multicut separating all pairs [m] costs at most 0(log 2 n) • B. Here c is a universal constant 
that is independent of the multicut instance. 

Such properties rely crucially on the specific problem structure, and cannot hold for general covering problems — 
even for the Steiner-tree cost function on a tree metric (which, in fact, is submodular). Consider a tree on vertices 
{r, u} Ul^iliLi w i tn root r an d terminals {vi}™ =1 . The edges set contains (r, u) with cost k, and for each i E [n] 
the edge (n, v i) with cost one. For parameter B = 2k, the cost for connecting any single terminal to the root is 
k + 1 > -7^, whereas the cost for connecting any k-set of terminals is 2k = B. If a theorem like the ones above 
held, we might have hoped the cost to connect all the n terminals would be O(B); instead it is n + k > ^ • B. 
This is also the reason why the algorithms for Steiner tree and Steiner forest (which appear in the next section) are 
slightly more involved, and their proofs rely on a primal-dual argument instead of dual rounding. 
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6 /c-Robust Steiner Forest 



In A; -robust Steiner forest, we have a graph G = (V,E) with edge costs c : E — > R + , and a set U C V x V 
of potential terminal pairs; any set in (^) is a valid scenario in the second stage. For a set of pairs S C V x V, 
the graph G/5 is obtained by identifying each pair in S together; d G / s (-, •) is the distance in this "shrunk" graph. 
The algorithm is given below. This algorithm is a bit more involved than the previous ones, despite a similar 
general structure: we maintain a set of "fake" pairs Sf that may not belong to U for this case. The following 
analysis shows a constant-factor guarantee. (Without |iines 6| f7|, the algorithm is more natural, but for that we can 
currently only show an O(log n)-approximation; it seems that an 0(l)-approximation for that version would imply 
an 0(log ^-competitiveness for online greedy Steiner forest.) 



Algorithm 4 Algorithm for /c-Robust Steiner Forest 

1: input: k -robust Steiner forest instance and threshold T. 

2: let/3 <- 6(1), 7 <- 9(1) such that 7 < p/2. 

3: let S r , S f , W <- 

4: while there exists a pair (s, t) G U with d G ^ SrUSf ^(s, t) > j3 ■ ^ do 
5: letS r <^S r U{(s,t)} 

6: if d G (s,w) < 7 • ? for some w G W then Sf 4- Sf U {(s,w)} else W <- W U {s} 
7: if d G (t,w') < 7 • | for some w' G then <- 5/ U {(i,«/)} else W <- W U {t} 

8: end while 

9: output first stage solution &t to be the 2-approximate Steiner forest [Jl], |l9|] on pairs S 1 ,. along with shortest- 
paths connecting every pair in Sf. 

10: define Augment T ({z}) to be the edges on the Sj — tj shortest-path in G j (S r U S'j), for each pair % G U. 

11: output second stage solution Augment T where Augment T (S') := [j ieD Augment T ({i}) for all D C U. 



Claim 6.1 (Property A for Steiner forest) For allT > D G rl), the edges $t U Augment T (D) connect 
every pair in D, and have cost c(Augment T (Z))) < j3T. 

Proof: The first part is immediate from the definition of Augment^ and the fact that connects every pair in 
S r U Sf. The second part follows from the termination condition dc/(s r us f )( s i^i) < ft • ^ for all pairs i G U; 
this implies c(Augment T (£>)) < J2ieD c(Augment T ({i})) < J2 te D d G/(S r us f )(si,ti) < ^ • (3T. U 

Lemma 6.2 The optimal value of the Steiner forest on pairs S r is at least \W\ x ^-jP. 

Proof: Consider the primal (covering) and dual (packing) LPs corresponding to Steiner forest on S r . Note that for 
each pair % G S r , the distance dc(si,ti) > f3 ■ j- > 27 • -j-; so any ball of radius \ ■ ¥ around a vertex in 5 r may 
be used in the dual packing problem since it separates some pair in S r . Observe that W consists of only vertices 
from S r , and each time we add a vertex to W, it is at least jT/ k distant from any other vertex in W. Hence we can 
feasibly pack dual balls of radius ^ • around each W-vertex. This is a feasible dual to the Steiner forest instance 
on S r , of value | WI7/2 • T/k. The lemma now follows by weak duality. ■ 

Lemma 6.3 The number of "witnesses" \W\ is at least the number of "real" pairs \S r \, and \S r \ is at least the 
number of "fake" pairs \Sf\. 

Proof: Partition the set S r as follows: S g are the pairs where both end-points are added to W, S Q are the pairs 
where exactly one end-point is added to W, and 5& are the pairs where neither end-point is added to W. It follows 
that \S r \ = \S g \ + \S a \ + \S b \ and \W\ = 2 • \S g \ + \S Q \. 

Consider an auxiliaiy graph H = (W, E{W)) on the vertex set W which is constructed incrementally: 

• When a pair (s, t) G S g is added, vertices s, t are added to W, and edge (s, t) is added to E(W). 

• Suppose a pair (s, t) G S Q is added, where s is added to W, but t is not because it is "blocked" by w' G W. 
In this case, vertex s is added, and edge (s, w') is added to E(W). 
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• Suppose a pair (s, t) £ St, is added, where s and t are "blocked" by w and «/ respectively. In this case, no 
vertex is added, but an edge (w, w') is added to E(W). 

Claim 6.4 At any point in the algorithm ifx, y GW lie in the same component of H then dQ^g fU g r ^(x, y) = 0. 

Proof: By induction on the algorithm, and the construction of the graph H. 

• Suppose pair (s, t) £ S g is added, then the claim is immediate. H has one new connected component {s, t} 
and others are unchanged. Since (s, t) £ S r , d G ^ SfUSr ^(s, t) = and the invariant holds. 

• Suppose pair (s, t) £ S is added, with s added to W and t blocked by w' £ W. In this case, the component 
of H containing w' grows to also contain s; other components are unchanged. Furthermore (t, w') is added 
to Sf and (s, t) to S r , which implies dc/(s f us r )(. s ^ w> ) = 0- S° me invariant continues to hold. 

• Suppose pair (s,t) £ S b is added, with s and t blocked by w,w' £ W respectively. In this case, the 
components containing w and w 1 get merged; others are unchanged. Also (s, w), (t, w') are added to Sf and 
(s,t) to S r ; so d G /( SfUSr ^(w, w') = 0, and the invariant continues to hold. 

Since these are the only three cases, this proves the claim. ■ 

Claim 6.5 The auxiliary graph H does not contain a cycle when 7 < (3/2 

Proof: For a contradiction, consider the first edge (x, y) that when added to H by the process above cre- 
ates a cycle. Let (s, t) be the pair that caused this edge to be added, and consider the situation just before 
(s,t) is added to S r . Since (x,y) causes a cycle, x,y belong to the same component of H, and hence 
do / (S f\js r )( x 1 v) = by the claim above. But since x is either s or its "blocker" w, and y is either t or its 
blocker w', it follows that d G /( SfUSr )(s, t) < 2j ■ ^ < (3 ■ ^. But this contradicts the condition which would 
cause (s, t) to be chosen into S r by the algorithm. ■ 

Now for some counting. Consider graph H at the end of the algorithm: W denotes its vertices, and E its edges. 
From the construction of H, we obtain \W\ = 2 • \S g \ + |,S | and \E\ = \S g \ + \S Q \ + |5&| = |5 r |. Since H is 
acyclic, \S r \ = \E\ < \W\ - 1. Also note that \S f \ = 2 • \S b \ + \S a \ = 2 • |5 r | - \W\ < \S r \. Thus we have 
\W\ > \S r \ > \Sf\ as required in the lemma. ■ 



Theorem 6.6 (Property B for Steiner forest) Let denote the optimal first stage solution (and its cost), and T* 
the optimal second stage cost. IfT>T* then c($t) < ^2 " (^* + T*). 

Proof: Let \S r \ = ak. Using Lemma 6.3 , Lemma 6?2\ and the optimal solution, 



• a ■ T < \W\ ■ || < OPT(S r ) < + 



]SV| 
k 



T* < + T* + a-T* < $* + T* + aT (6.2) 



Thus a ■ T < ^3 • (** + T *) and OPT(S r ) < • ($* + T*). So the 2-approximate Steiner forest on S r has 
cost at most ■ ($* + T 1 *). Note that the distance between each pair in Sf is at most 7 • |; so the total length 



of shortest-paths in 5/ is at most |S*j| ■ 7 • 4 < |<SV| • 7 ■ 4r (again by Lemma 6.3). Thus the algorithm's first-stage 
cost is at most ^_ • ($* + T*) + 07 • T < ^_ • ($* + T*). 



Theorem 6.7 (Steiner Forest Main Theorem) There is a 10-approximation for k-robust Steiner forest. 



Proof: Using Claim 6.1 and Theorem 6.6, we obtain a (^5, /3) -discriminating algorithm (Definition 2.1) for 



A;-robust Steiner forest. Setting j3 = 27 and 7 := 2 + 2 • (1 — 1/A), |Lemma 2^2| implies an approximation ratio of 

-7-2' 7-2 ^^7/ ^ 1-1/A- 



max{ ^2 , + 27} < 4 + p^ym • Again the trivial algorithm that only buys edges in the second-stage achieves 



a 2A-approximation. Taking the better of the two, the approximation ratio is min{2A, 4 + < 10. 
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The /c-max-min Steiner Forest Problem. We now extend the A; -robust Steiner forest algorithm to b e (^jgj ^ ) 
strongly discriminating (when 7 = 3). As shown earlier, it is indeed discriminating. To show that pefinition 2.3| 
holds, consider the proof of Theorem 6.6 when A = 1 (so <£* = 0) and suppose c($v) > ^ 27 T. The 

algorithm to output the /c-set Q has two cases. 



1. If the number of "real" pairs \S r \ < k then Q := S r . We have: 

c($t) < 2 • OPT(Sr) + ^\S f \<2- OPT(S r ) + ^ |5 r | < 2 • OPT(S r ) + 7T. 

rC rC 

The first inequality is by definition of <I>r and since distance between each pair in Sf is at most 7 ■ ?, the 
second inequality is by f^emma 63 , and the last inequality uses | Since c($>t) > 27T, it follows 



that OPT(S r ) > 7T/2 > T. 



If \S r \ > k then the number of "witnesses" \W\ > \S r \ > k,by pemma 63 . Let Q C S r be anj A;-set of 



pairs such that for each i e Qat least one of {si,£i} is in W. By the construction of S r , we can feasibly 
pack dual balls of radius \ \ around each W -vertex, and so OPT(Q) > \Q\ ■ \\ = \T > T. 



Thus we obtain a constant-factor approximation algorithm for fc-max-min Steiner forest. 



7 Final Remarks 

In this paper, we presented a unified approach to directly solving /c-robust covering problems and fc-max-min 
problems. The results for all problems except multicut are fairly tight (and nearly match the best-possible for 
the offline versions). It would be interesting to obtain an 0(log n)-approximation for /c-robust and /c-max-min 
multicut. 

As mentioned earlier, approximating the value of any max-min problem reduces to the corresponding robust prob- 
lem, for any uncertainty set. We show in the companion paper [^3|] that there is also a relation in the reverse 
direction — for any covering problem that admits good offline and online approximation algorithms, an algorithm 
for the max-min problem implies one for the robust version. This reduction can be used to give algorithms for 



robust covering under matroid- and knapsack-type uncertainty sets [23] 
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A /c-Robust Steiner Tree 

In the /c -robust Steiner tree, we are given a graph G = (V, E) with edge costs c : E — >■ M.+, a root vertex r, and a 
set U C V of potential terminals. Any set of k terminals from U — i.e., any set in (^) — is a valid scenario in the 
second stage. Let d(-, •) be the shortest-path distance according to the edge costs. For a set S C V of terminals, 
define the distance d(v, S) := min^gs d(v, w). 

By the results in [Section 24] , a discriminating algorithm for this problem immediately gives us an algorithm for the 
robust version, and this is how we shall proceed. Here is our discriminating algorithm for A; -robust Steiner tree: it 
picks a /3T /k-net S of the terminals in U, and builds a MST on S as the first stage. 
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Algorithm 5 Algorithm for /c-Robust Steiner Tree 



1: input: instance of /c-robust Steiner tree and threshold T. 

2: let p 4- 9(1), S <- {r}. 

3: while there exists a terminal v G U with d(v, S) > f3 ■ ? do 
4: S^5U{u} 

5: end while 

6: output first-stage solution &t to be a minimum spanning tree on S. 

7: for each i £ U, define Augment r ({i}) to be the edges on a shortest-path from i to S. 

8: output second-stage solution Augment T where Augment T (D) := [j ieD Augment r ({i}) for all D C U. 



To show that the algorithm is discriminating, we need to show the two properties in [Definition 2.1| . The first 
property is almost immediate from the construction: since every point in U \ S is close to some point in the net S, 
this automatically ensures that the second stage recourse cost is small. 

Claim A.l (Property A for Steiner Tree) For allT > and D £ (i), the edges $t U Augment^!)) connect 
the terminals in D to the root r, and have cost c(Augment T (D)) < f3T. 

Proof: From the definition of the second-stage solution, Augment T (Z)) contains the edges on shortest paths from 
each D-vertex to the set S. Moreover, &t is a minimum spanning tree on S (which in turn contains the root r). 
Hence <3?t U Augment^!)) connects D to the root r. To bound the cost, note that by the termination condition in 
the while loop, every terminal % 6 U satisfies d(i,S) < {5%. Thus, 

c(Augment T ( J D)) < ^ c(Augment T ({i})) = ^ d(i, S) < ■ ft T. 

This completes the proof that the algorithm above satisfies Property A. ■ 

It now remains to show that the algorithm satisfies Property B as well. Let us show this for a sub-optimal settings 
of values; we will improve on these values subsequently. The proof is dual-based and shows that if the cost of the 
MST on S were large, then the optimal first stage solution cost <!>* must have been large as well! 



Theorem A.2 (Property B for Steiner tree) Let <!>* denote the optimal first stage solution (and its cost), and T* 

ii- 



the optimal second stage cost. IfT > T* then the first stage cost c($>t) < 2 • A • (3>* + T* 



Proof: Suppose \S\ = ak. We can divide up S into \a] sets Si, S2, ■ ■ ■ , Sr a i with at most k terminals each, and let 
E(Si) denote the second-stage edges bought by the optimal solution under scenario Si. Hence $* U (Uj< ^ E(Si)) 
is a feasible solution to the Steiner tree on S of cost at most + [a] • T*. Also, since each of the points in 5 is at 
least at distance /3T/k from each other, we get (below OPT(S) is the length of the minimum Steiner tree on S), 

f • aT = \S\ ■ § • £ < OPT(S) < + \a] ■ T* < $* + (a + 1) • T* < $* + T* + aT. 

Hence aT < -^{<5>* + T*) and OPT(S) < ■ ($* + T*); since the MST heuristic is a 2-approximation to 
the optimal Steiner tree, we get the theorem. ■ 



Combining Claim A. 1 and Theorem A.2 shows that our algorithm is a -J^j, /5) -discriminating algorithm for 



k -robust Steiner tree. Setting, say, j3 = 4 and applying Lemma 2.2 gives us an max(4, 4 + 4) = 8-approximation 



for A; -robust Steiner tree. In the next subsection, we will show how to improve this guarantee. 
A.l Improved Approximation for Steiner Tree 

In the previous analysis, we just wanted to show the main ideas and hence were somewhat sloppy with the analysis. 
Let us now show how to get a tighter bound using a fractional analysis. 
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Theorem A.3 (Improved Property B for Steiner Tree) IfT > T* then c($ T ) < • $* + 2 • T*. 

Proof: Firstly suppose |5| < k: then it is clear that there is a Steiner tree on {r} U S of cost at most <&* + T* , and 
the algorithm finds one of cost at most twice that. In the following assume that \S\ > k. 

Let LP(S) denote the minimum length of a. fractional Steiner tree on terminals {r} U S. Since each of the points in 
S is at least at distance f3 • ? > /3- ^ from each other, we get LP(S) > Jr • |5| • T*. We now construct a fractional 
Steiner tree x : E — > K + of small length. Number the terminals in S arbitrarily, and for each 1 < j < |5| let 
Aj = {j, j + 1, • • • , j + k — 1} (modulo Let TL; C E \ <3?* denote the second-stage edges bought in the 
optimal solution under scenario Af so $* U IL; is a Steiner tree on terminals {r} U Aj, and c(Uj) < T*. Define 

x := x(^*) + £ " Sj=i x(rij). We claim that x supports unit flow from r to any i G S: note that there are k sets 
Ai-k + i, • • • , A{ that contain i, and for each i — k + 1 < j < i, we have I ■ (x(&*) + x(rij)) supports \ flow from 
r to i. Thus x is a feasib 
lower bound on LP(S), 



I s\ 

r to i. Thus x is a feasible fractional Steiner tree on {r} U S, of cost at most <!>* + M • T*. Combined with the 



|5| • | • ^ < LP(5) < + l|l -T*. (A.3) 

Thus we have LP(S) < -J^ ■ which imphes the theorem since the minimum spanning tree on {r} U S costs 
at most twice LP (S). ■ 



From Claim A.l and [Theorem A.3 , we now get that the algorithm is (-g^rg, 2, f3) -discriminating. Thus, setting 



P = 2 — j + ^4 + 1/A 2 and applying Lemma 2.2, we get the following approximation ratio 



max <! 7T^T> T + Z 3 ) = 2 + T + V 4 + 



/3-2' A J A V A 2 ' 

On the other hand, the trivial algorithm which does nothing in the first stage is a 1.55 • A approximation. Hence the 
better of these two ratios gives an approximation bound better than 4.5. 

The /c-max-min Steiner Tree Problem. We show that the above algorithm can be extended to be (-J^,2,/3) 



strongly discriminating. As shown above, it is indeed discriminating. To show that Definition 2.3 holds, consider 



the proof of [Theorem A.3| when A = 1 (so <3>* = 0) and suppose that c($t) > 2T. The algorithm to output the 



A;-set Q proceeds via two cases. 

1. lf\S\<k then Q := S. The minimum Steiner tree on Q is at least half its MST, i.e. at least \c{^t) > T. 

2. If \S\ > k then Q C S is any fe-set; by the construction of S, we can feasibly pack dual balls of radius 
around each Q-vertex, and so LP(Q) > j3T > T. Thus the minimum Steiner tree on Q is at least T. 

A.2 Unrooted Steiner tree 



We note that the k -robust Steiner tree problem studied above differs from [ 27 ] since there is no root in the model 



of [|27p. In the unrooted version, any subset of k terminals appear in the second stage, and the goal is to connect 
them amongst each other. We show that a small modification in the proof implies that Algorithm 1 (where r G U 
is set to an arbitrary terminal) achieves a good approximation in the unrooted case as well. This algorithm is 



essentially same as the one used by [27], but with different parameters: hence our framework can be viewed as 



generalizing their algorithm. Our proof is somewhat shorter and gives a slightly better approximation ratio. 

Below, <I>* and T* denote the optimal first and second stage costs for the given unrooted instance. It is clear that 
|Claim A.l continues to hold in this case as well: hence Property A of Definition 2. 1 is satisfied. We next bound 



the first stage cost of the algorithm (i.e. Property B of Definition 2. 1 ) 



Theorem A.4 (Property B for Unrooted Steiner Tree) IfT > T* then c($ T ) < ^ • + 2T*. 
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Proof: Firstly suppose \S\ < k: then it is clear that there is a Steiner tree on S of cost at most <£* + T*, and the 
algorithm finds one of cost at most twice that. In the following assume that |5| > k. 

Let LP{S) denote the minimum length of & fractional Steiner tree on terminals S (recall, no root here). Since each 
of the points in S is at least at distance /3 • ? > /3 • ^ from each other, we get LP(S) > jr ■ \S\ ■ T*. We now 
construct a fractional Steiner tree x : E — > R + of small length. Number the terminals in S arbitrarily, and for each 
1 < j < \S\ let Aj = {j,j + !,■■■ + 1} (modulo \S\). Let Uj C E \ denote the second-stage edges 
bought in the optimal solution under scenario Ay. so <£* U IT, is a Steiner tree on terminals Aj, and c(Hj) < T*. 

Defines := X ($*) + | -E^iX(ni). 

Claim A.5 For any i £ S, x supports a unit flow from terminal i to i + 1 (modulo \S\). 

Proof: Note that there are k — 1 sets Ai_k+2, ■ ■ ■ , Ai that contain both i and Let J := {i — k + 2, • • • , i}. 
So for each j G J, we have £-(x($*) + x(IIj)) supports ^ flow from i to i+1. Furthermore, (Ui g 5\jri/)u$* 

is a Steiner tree connecting terminals Ui e g\jAi D + i.e. | • + S/gS\J also supports 

i flow from i to i + 1. Thus we obtain the claim. ■ 

Thus x is a feasible fractional Steiner tree on terminal S, of cost at most $ + M • T . Combined with the lower 
bound on LP(5), 



|^| ■ I • T 1 < i-P(-S) < + Jf- • T*. (A.4) 



Thus we have LP(S) < ' *^*> which implies the theorem since the minimum spanning tree on S costs at most 
twice LP(S). M 

Thus by the same calculation as in the rooted case, we obtain a result that slightly improves on the constants 



obtained by [ ]27[ ] for the same problem. 

Theorem A. 6 There is a 4. 5 -approximation algorithm for (unrooted) k-robust Steiner tree. 

B fc-Robust Set Cover with Non-uniform Inflation 

Consider the A; -robust set cover problem where there is a set system (U, {i?j}^! =1 ) with a universe of n elements 
and m sets with cost-vectors b, c € W? (for first and second stage resp.), and a bound k on the cardinality of the 



realized demand-set. The model considered in |Section 3| is the special case when c = Xb for some uniform inflation 
factor A. Here we consider the general case of set-dependent inflation, and show that the same result holds. We 
may assume WLOG that the first-stage cost for each set is at most its second-stage cost, i.e. b < c. (If some set R 
has cr < bR, then we pretend that its first-stage cost is cr; and if R is chosen into the first-stage solution it can be 
always bought in the second stage). 



Under non-uniform inflations, the definition of an (a\ , ai , /^-discriminating algorithm is the same as Definition 2. 1 
where Condition B is replaced by: 

B'. Let <!>* denote the optimal first stage solution, and T* the optimal second stage c-cost (hence the optimal 
value Opt = &($*) + T*). If the threshold T > T* then the first stage cost b($ T ) < a x ■ + a 2 • T*. 



It can be shown exactly as in |Lemma 2"X that any such algorithm is a max{ai, «2 + /?} -approximation for k- 



robust set cover. Note that the factor 012 was scaled down by A in the uniform inflation case ( jLemma 2.2[ ). The 



algorithm and analysis here are very similar to that for fc-robust set-cover under uniform inflation (Section 3). 



We will show that this algorithm is (H n , l2H n , 36 In m)-discriminating. The following claim is immediate. 

Claim B.l (Property A) For all T > and oj C U, the sets $r (J Augment T (a;) cover elements uj; additionally 
if\w\ < k then the cost c(Augment T (o;)) < f3T. 

Theorem B.2 (Property B') Assume f3 > 36 • mm. IfT > T* then 6($ T ) < H n ■ + 12 • T*). 
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Algorithm 6 Algorithm for Cardinality Robust Set Cover with non-uniform inflation 



1: input: robust set-cover instance and bound T. 

2: let j3 <— 36 • In m, and S <— {v G U | minimum c-cost set covering v has cost at least /?■?}• 

3: output first stage solution &t as the Greedy-Set-Cover(S') under 6-costs. 

4: define Augment T ({i}) as the minimum c-cost set covering i if i G U \ S, and otherwise. 

5: output second stage solution Augment r where Augment r (u;) := (Jig w Augment T ({i}) for all uj C fj. 



Proof: We will show that there is a. fractional solution x for covering 5 with small 6-cost, at most &(<!>*) + 12 • 
T* , whence rounding this to an integer solution implies the theorem. For a contradiction, assume not: let every 
fractional set cover be expensive, and hence there must be a dual solution of large value. 

Let S' C S denote the elements that are not covered by the optimal first stage <3?*, and let T' C T denote the sets 
that contain at least one element from S'. By the choice of S, all sets in T' have c-cost at least /3 ■ ? > ft ■ tt- 
Define the "coarse" cost for a set R £ J 7 ' to be cr = [~ 6 ff/fc l- For each set R G J 7 ', since cr > > it 
follows that cj? • G [cr, 2 • cr), and also that cr > /3/6. 

Now consider the LP for the set cover instance with elements 5' and sets T' having the coarse costs c. Let 
{xrIrgF' be an optimal fractional solution; then |Claim 3.3| applies directly to yield: 

^c R -x R <2-k (B.5) 



Given the primal LP solution {xr}r € jti to cover elements in S', define a fractional solution z covering elements 
S as follows: define zr = 1 if R G <&*, zr = xr if R G J 7 ', and zr = otherwise. Since the solution z contains 
$* integrally, it covers elements S \ S' (i.e. the portion of 5 covered by $*); since zr > xr for all R G J 7 ', z 
fractionally covers S' . Finally, the 6-cost of this solution is: 

6T* 

b ■ z = &($*) + b • x < + c • x < + — • (c • x) < &($*) + 12 • T*, 



where the second inequality uses b < c, the next one is by definition of c and the last inequality is from (B.5). Thus 
we have an LP solution of 6-cost <£* + 12T*, and since the greedy algorithm is an ^-approximation relative to 
the LP value, this completes the proof. ■ 

Thus we obtain: 

Theorem B.3 There is an 0(log m + log n)-approximation for k-robust set cover with set-dependent inflations. 
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